SEQUEL - 2014 - Annual activity report

SEQUEL

SEQUEL - 2014

Project-Team Sequel

Members

Overall Objectives

Presentation

Research Program

Application Domains

New Software and Platforms

New Results

Bilateral Contracts and Grants with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Previous |

Home | Next next

Section: Partnerships and Cooperations

European Initiatives

FP7 & H2020 Projects

CompLACS

Type: FP7
Defi: Cognitive Systems, Interaction, Robotics
Instrument: Specific Targeted Research Project
Objectif: Cognitive Systems and Robotics
Duration: March 2011 - February 2015
Coordinator: John Shaw-Taylor
Partner: University College London, University of Bristol, Royal Holloway, University of London, Radboud Universiteit Nijmegen, Technische Universitat Berlin, Montanuniversitat Leoben, Institut National de Recherche en Informatique et en Automatique, Technische Universität Darmstadt
Inria contact: Rémi MUNOS
Abstract: One of the aspirations of machine learning is to develop intelligent systems that can address a wide variety of control problems of many different types. However, although the community has developed successful technologies for many individual problems, these technologies have not previously been integrated into a unified framework. As a result, the technology used to specify, solve and analyse one control problem typically cannot be reused on a different problem. The community has fragmented into a diverse set of specialists with particular solutions to particular problems. The purpose of this project is to develop a unified toolkit for intelligent control in many different problem areas. This toolkit will incorporate many of the most successful approaches to a variety of important control problems within a single framework, including bandit problems, Markov Decision Processes (MDPs), Partially Observable MDPs (POMDPs), continuous stochastic control, and multi-agent systems. In addition, the toolkit will provide methods for the automatic construction of representations and capabilities, which can then be applied to any of these problem types. Finally, the toolkit will provide a generic interface to specifying problems and analysing performance, by mapping intuitive, human-understandable goals into machine-understandable objectives, and by mapping algorithm performance and regret back into human-understandable terms.

Previous |

Home | Next next